[SPARK-6486] [MLlib] [Python] Add BlockMatrix to PySpark.#7761
[SPARK-6486] [MLlib] [Python] Add BlockMatrix to PySpark.#7761dusenberrymw wants to merge 7 commits intoapache:masterfrom
Conversation
|
@dusenberrymw Did you coordinate with @MechCoder about this? I thought he already started working on it. |
|
Test build #38916 has finished for PR 7761 at commit
|
|
@mengxr We've been discussing things on PR #7554, but I wasn't sure if Manoj was working on this one. It's a pretty natural extension of 7554, so I already had most of it completed. @MechCoder Apologies if we were both working on this one. I've appreciated the thoughts on #7554, and I'd definitely love any thoughts on this one as well. |
|
@dusenberrymw Next time, you should check and ping the JIRA page first before you start coding to avoid duplicate work. |
|
@mengxr Yes, certainly. As soon as you merge #7746, I can finish #7554 and this PR. I'd also like to work on SPARK-6488. |
|
Test build #38937 has finished for PR 7761 at commit
|
|
Jenkins, retest this please. |
|
Test build #38955 has finished for PR 7761 at commit
|
|
Test build #150 has finished for PR 7761 at commit
|
|
@dusenberrymw You could have pinged me on the JIRA, since I already started work on this. (But no hard feelings) I can give a pass on this after I get back access to my machine on Saturday. |
|
@MechCoder Yes, apologies again for not realizing you were also working on this. I look forward to your thoughts! |
…logic to match that of the other distributed matrices, adding conversions, and adding documentation.
9a813d3 to
b8acc1c
Compare
|
Test build #39808 has finished for PR 7761 at commit
|
|
Test build #39834 has finished for PR 7761 at commit
|
mengxr This adds the `BlockMatrix` to PySpark. I have the conversions to `IndexedRowMatrix` and `CoordinateMatrix` ready as well, so once PR #7554 is completed (which relies on PR #7746), this PR can be finished. Author: Mike Dusenberry <mwdusenb@us.ibm.com> Closes #7761 from dusenberrymw/SPARK-6486_Add_BlockMatrix_to_PySpark and squashes the following commits: 27195c2 [Mike Dusenberry] Adding one more check to _convert_to_matrix_block_tuple, and a few minor documentation changes. ae50883 [Mike Dusenberry] Minor update: BlockMatrix should inherit from DistributedMatrix. b8acc1c [Mike Dusenberry] Moving BlockMatrix to pyspark.mllib.linalg.distributed, updating the logic to match that of the other distributed matrices, adding conversions, and adding documentation. c014002 [Mike Dusenberry] Using properties for better documentation. 3bda6ab [Mike Dusenberry] Adding documentation. 8fb3095 [Mike Dusenberry] Small cleanup. e17af2e [Mike Dusenberry] Adding BlockMatrix to PySpark. (cherry picked from commit 34dcf10) Signed-off-by: Xiangrui Meng <meng@databricks.com>
|
LGTM. Merged into master and branch-1.5. Thanks! |
|
Great, thanks @mengxr! |
@mengxr This adds the
BlockMatrixto PySpark. I have the conversions toIndexedRowMatrixandCoordinateMatrixready as well, so once PR #7554 is completed (which relies on PR #7746), this PR can be finished.